NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction

Yang, Yue; Panagopoulou, Artemis; Apidianaki, Marianna; Yatskar, Mark; Callison-Burch, Chris (December 2022, Findings of The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022))

Neural language models encode rich knowledge about entities and their relationships which can be extracted from their representations using probing. Common properties of nouns (e.g., red strawberries, small ant) are, however, more challenging to extract compared to other types of knowledge because they are rarely explicitly stated in texts. We hypothesize this to mainly be the case for perceptual properties which are obvious to the participants in the communication. We propose to extract these properties from images and use them in an ensemble model, in order to complement the information that is extracted from language models. We consider perceptual properties to be more concrete than abstract properties (e.g., interesting, flawless). We propose to use the adjectives’ concreteness score as a lever to calibrate the contribution of each source (text vs. images). We evaluate our ensemble model in a ranking task where the actual properties of a noun need to be ranked higher than other non-relevant properties. Our results show that the proposed combination of text and images greatly improves noun property prediction compared to powerful text-based language models.
more » « less
Full Text Available
Self-Supervised Optical Flow with Spiking Neural Networks and Event Based Cameras

https://doi.org/10.1109/IROS51168.2021.9635975

Chaney, Kenneth; Panagopoulou, Artemis; Lee, Chankyu; Roy, Kaushik; Daniilidis, Kostas (January 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Full Text Available
Visual Goal-Step Inference using wikiHow

https://doi.org/10.18653/v1/2021.emnlp-main.165

Yang, Yue; Panagopoulou, Artemis; Lyu, Qing; Zhang, Li; Yatskar, Mark; Callison-Burch, Chris (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Understanding what sequence of steps are needed to complete a goal can help artificial intelligence systems reason about human activities. Past work in NLP has examined the task of goal-step inference for text. We introduce the visual analogue. We propose the Visual Goal-Step Inference (VGSI) task, where a model is given a textual goal and must choose which of four images represents a plausible step towards that goal. With a new dataset harvested from wikiHow consisting of 772,277 images representing human actions, we show that our task is challenging for state-of-the-art multimodal models. Moreover, the multimodal representation learned from our data can be effectively transferred to other datasets like HowTo100m, increasing the VGSI accuracy by 15 - 20%. Our task will facilitate multimodal reasoning about procedural events.
more » « less
Full Text Available

Search for: All records